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AMENDMENTS TO THE CLAIMS 

1 . (Previously Presented) A method for editing media in an electronic apparatus with 
digital audio/video processing capability, comprising: 

receiving audio data and a plurality of associated audio descriptors, which describe 
characteristic of said audio data, from an audio source connecting to said electronic apparatus; 

receiving visual data included a plurality of visual segments, and a plurality of associated 
visual descriptors, which describe characteristic of said visual data, from a video source 
connecting to said electronic apparatus; 

determining a plurality of corresponding weights for said visual data, said weights 
indicating qualities, importance, or preference of said visual data; 

correlating said audio data and said visual data based on said corresponding weights, said 
associated audio descriptors, and said associated visual descriptors; and 

adjusting said audio data and said visual data to construct a media output; 

wherein the receiving audio data and said associated audio descriptors comprises: 

receiving an audio signal from said audio source; and 

generating a plurality of audio indices by choosing said audio signal with audio change 

therein; 

wherein the step of correlating further comprises: 
finding a duration on each said visual segment; 
determining a searching window based on said duration; 

finding, within said searching window, a first index on said audio indices, wherein said 
first index is more than other indices on said audio indices within said searching window; and 
adjusting each said visual segment, based on a time corresponding to said first index. 

2. (Previously Presented) The method according to claim 1, further comprising 
rendering said media output with style information to a device with at least one audio/ video 
output built in or connecting to said electronic apparatus. 
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3. (Previously Presented) The method according to claim 1, wherein the step of 
receiving audio data and said associated audio descriptors comprises: 

receiving an audio signal from said audio source; and 

analyzing and segmenting said audio signal for generating said audio data and said 
associated audio descriptors, wherein said audio data consists of a plurality of audio segments. 

4. (Cancelled) 

5. (Previously Presented) The method according to claim 4, wherein the step of 
determining a plurality of corresponding weights comprises calculating any said corresponding 
weight for respective said visual segment. 

6. (Previously Presented) The method according to claim 5, wherein the step of 
correlating comprises: 

extracting an audio duration, from said associated audio descriptors, for respective said 
audio segment ; 

extracting a visual duration, from said associated visual descriptors, for respective said 
visual segment; 

evaluating a plurality of correlating scores for respective sequences of said visual 
segments, based on said corresponding weights, said corresponding audio durations and said 
corresponding visual durations; and 

finding a sequence of visual segments with a correlating score that is the maximal within 
said plurality of correlating scores. 

7. (Cancelled) 

8. (Cancelled) 
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9. (Previously Presented) A production method of media output in an electronic 
apparatus with digital audio/video processing capability, comprising: 

receiving audio segments and a plurality of associated audio descriptors, which describe 
characteristic of said audio segments, from an audio source connecting to said electronic 
apparatus; 

receiving visual segments and a plurality of associated visual descriptors, which describe 
characteristic of said visual segments, from a video source connecting to said electronic 
apparatus; 

determining a plurality of corresponding weights for each said visual segment, said 
weights indicating qualities, importance, or preference of said visual segment; 

extracting a visual duration, from said associated visual descriptors, for each said visual 
segment; 

extracting an audio duration, from said associated audio descriptors, for each said audio 
segment; 

evaluating a plurality of correlating scores for respective sequences of said visual 
segments, based on said corresponding weights, said corresponding audio durations and said 
corresponding visual durations; 

finding a sequence of visual segments with a correlating score that is the maximal within 
said plurality of correlating scores; and 

adjusting said audio segments and said visual segments to generate a media output. 

10. (Previously Presented) The production method according to claim 9, further 
comprising rendering said media output with style information to a device with at least one 
audio/video output built in or connecting to said electronic apparatus. 

11. (Previously Presented) The production method according to claim 9, wherein the 
step of receiving audio segments and associated audio descriptors comprises: 

receiving an audio signal from said audio source; and 
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analyzing and segmenting said audio signal for generating said audio segments and said 
associated audio descriptors. 

12. (Previously Presented) The production method according to claim 9, wherein the 
step of receiving visual segments and associated visual descriptors comprises: 

receiving an video signal from said video source; and 

analyzing and segmenting said video signal for generating said video segments and said 
associated visual descriptors. 

13. (Previously Presented) The production method according to claim 9, wherein said 
visual segments and said associated visual descriptors are in format of MPEG-7. 

14. (Previously Presented) The production method according to claim 9, wherein said 
audio segments and said associated audio descriptors are in format of MPEG-7. 

15. (Currently Amended) A production method of media output in an electronic 
apparatus with digital audio/video processing capability, comprising: 

receiving audio data and a plurality of associated audio descriptors, which describe 
characteristic of said audio data, from an audio source connecting to said electronic apparatus; 

receiving visual data and a plurality of associated visual descriptors, which describe 
characteristic of said visual data, from a video source connecting to said electronic apparatus* 
and wherein said visual data comprises a plurality of visual segments; 

finding a duration on each said visual segment: 

determining a searching window based on said duration: 

determining a plurality of corresponding weights for said visual data, said weights 
indicating qualities, importance, or preference of said visual segment; 

finding, within a within said searching window, a value corresponding to said associated 
audio descriptors on said audio data, wherein said value is more than other value corresponding 
to associated audio descriptors within said searching window; and 
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adjusting said visual data, based on a time corresponding to said value, to generate a media 
output, wherein said media output is based on audio data and said adjusted visual data. 

16. (Previously Presented) The production method according to claim 15, further 
comprising rendering said media output with style information to a device with at least one 
audio/video output built in or connecting to said electronic apparatus. 

17. (Previously Presented) The production method according to claim 15, wherein said 
visual data and said associated visual descriptors are in format of MPEG-7. 

18. (Previously Presented) The production method according to claim 15, wherein said 
audio data and said associated audio descriptors are in format of MPEG-7. 

19. (Previously Presented) The production method according to claim 15, wherein the 
step of receiving said audio data and said associated audio descriptors comprises: 

receiving an audio signal from said audio source; and 

generating a plurality of audio indices by choosing said audio signal with audio change 

therein. 

20. (Canceled) 

21. (Currently Amended) A computer readabl e computer editing system for audio/video 
comprising: 

means for receiving audio data and a plurality of associated audio descriptors, which 
describe characteristic of said audio data, from an outside audio source connecting to said 
computer editing system; 

means for receiving visual data and a plurality of associated visual descriptors, which 
describe characteristic of said visual data, from an outside video source connecting to said 
computer editing system; 
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means for a weighting processing device for determining a corresponding weight for 

inputted visual segmen t by a processor in said computer editing system , said weight indicating 
qualities, importance, or preference of said visual segment; 

means for an e xtracting d e vice for extracting a visual duration, from said associated 
visual descriptors, for each said visual segment; 

means for an extracting d e vice for extracting an audio duration, from inputted associated 
audio descriptors, for each said audio segment; 

means for a e valuating devic e for evaluating a plurality of correlating scores for 
respective sequences of said visual segment s by said processor in said computer editing system , 
based on said corresponding weights, said corresponding visual durations and said corresponding 
audio duration; 

means for a searching devic e for finding a sequence of visual segments with a correlating 
score that is the maximal within said plurality of correlating scores; and 

means for a construction device for adjusting said audio segments and said segments to 
generate a media output. 

22. (Currently Amended) A computer r e adabl e computer editing system for audio/video 
comprising: 

means for receiving audio data and a plurality of associated audio descriptors, which 
describe characteristic of said audio data, from an outside audio source connecting to said 
computer editing system; 

means for receiving visual data and a plurality of associated visual descriptors, which 
describe characteristic of said visual data, from an outside video source connecting to said 
computer editing system: 

means for finding a duration on each said visual data: 

means for determining a searching window based on said duration by a processor in said 
computer editing system: 
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means for a weighting proc e ssing device for determining a plurality of corresponding 
weights for inputted visual dat a by said processor in said computer editing system , said weights 
indicating qualities, importance, or preference of said visual data; 

means for a s e arching devic e for finding, within aw ithin said searching window, a value 
corresponding to inputted associated audio descriptors on inputted audio data by said processor 
in said computer editing system, wherein said value is more than other value corresponding to 
said associated audio descriptors within said searching window; and 

means for a construction d e vice for adjusting said visual data, based on a time 
corresponding to said value, to generate a media output, wherein said media output is based on 
said audio data and said adjusted visual data. 
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